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Abstract 

Recently, Diaconis, Ram and I created Markov chains out of the coproduct-then-product 
operator on combinatorial Hopf algebras. These chains model the breaking and recombining of 
combinatorial objects. Our motivating example was the riffle-shuffling of a deck of cards, for 
which this Hopf algebra connection allowed explicit computation of all the eigenfunctions. The 
present note replaces in this construction the coproduct-then-product map with convolutions of 
projections to the graded subspaces, effectively allowing us to dictate the distribution of sizes 
of the pieces in the breaking step of the previous chains. An important example is removing 
one “vertex” and reattaching it, in analogy with top-to-random shuffling. This larger family of 
Markov chains all admit analysis by Hopf-algebraic techniques. There are simple combinatorial 
expressions for their stationary distributions and for their eigenvalues and multiplicities and, in 
some cases, the eigenfunctions are also calculable. 

Recemment, avec Diaconis et Ram, nous avoirs construit des chaines de Markov a partir de 
I’operateur “coproduit-puis-produit” defini sur un algebre de Hopf combinatoire. Ces chaines 
modelisent la deconstruction et la construction d’objets combinatoires. La motivation etait 
le “melange a I’americaine”, une methode populaire pour melanger un jeu de cartes, pour 
lequel les liens avec les algebres de Hopf combinatoires nous a permis de calculer explicitement 
toutes les fonctions propres. Id, on generalise cette construction en remplagant I’operateur 
“coproduit-puis-produit” par les convolutions de projections sur les composantes graduees de 
I’algebre. Ceci nous permet de stipuler les tallies des pieces dans la decomposition des objets 
combinatoires. Un example important est la suppression et I’insertion d’un “sommet”, par 
analogie avec la bibliotheque de Tsetlin. On constate que toutes ces chaines peuvent etre 
analysees par des techniques provenant de la theorie des algebres de Hopf combinatoires. On 
prouve des expressions combinatoires simples pour les distributions stationnaires ainsi que pour 
les valeurs propres et leurs multiplicites. Dans certains cas, il est possible de calculer les fonctions 
propres associees. 

This version: March 31, 2015. This extended abstract was accepted as a talk for the 27th In¬ 
ternational Conference on Formal Power Series and Algebraic Combinatorics (FPSAC) in Daejoen, 
South Korea, in July 2015, and is published in a proceedings volume of Discrete Mathematics and 
Theoretical Computer Science (DMTCS). 

1 Introduction. 

Background: Markov chains from Hopf-powers 

Possibly the most popular model of card-shuffling is the Gilbert-Shannon-Reeds (GSR) riffle-shuffle: 
cut the deck into two piles according to the (symmetric) binomial distribution, then drop one-by- 
one the bottom card from either pile, chosen with probability proportional to the current pile size. 
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This second step is equivalent to all interleavings of the two piles (counted with multiplicity) being 
equally likely. 

Amongst the pleth ora of results concerning this shuffle, the most notable must be the work 
of (Bayer and Diaconis 19921 ). who determined that |logn shuffles are necessary and sufficient to 
randomise a deck of n cards. Central to their argument is the generalisation of the GSR model to 
a-handed shuffles: cut the deck into a piles according to a (symmetric) multinomial distribution, 
then drop the bottom cards from a pile chosen with probability proportional to pile size as before. 
Performing the GSR shuffle t times is then the same as a 2*-handed shuffle, so analysing long-term 
behaviour of 2-handed riffle-sh uffles is equivalent to letting the number of hands tend to infinity. 

(Diaconis, Pang, and Ram 20141 ) observed that the transition probabilities of the riffle-shuffle 
are, up to scaling, the coefficients of the coproduct-then-product map mA on the shuffle algebra. 
Furthermore, the coefficients of the ath Hopf-power map on the shuffle algebra give the 

transition probabilities of the o-handed riffle-shuffle. We then defined Markov chains on the bases 
of other combinatorial Hopf algebras by setting their transition probabilities to be such Hopf- 
power coefficients, with a little modificiation (via the Doob transform). These Hopf-power Markov 
chains model the breaking th en rec ombining of the combinatorial objects indexing the bases of the 
algebras. The thesis of (Pang 120141 ) greatly extends the Hopf-power Markov chain framework; this 
encompasses a restriction-then-induction chain on representations of the symmetric groups, and a 
tree-pruning model - see Examples 13.51 and 15.31 below. 


Probabilistic conclusions from Hopf-algebraic techniques 


The benefit of this viewpoint is two-fold. Firstly, as recorded by (Pang [201J, Th. 4.7.1), maps 
betw een Hopf algebras which “respect the bases” induce projections of the related chains. (Pang 
2013l j applied this to a map from the shuffle algebra to the algebra of quasisymmetric functions, to 
conclude that the positions of descents under riffle-shuffling of a deck of distinct cards is a Markov 
statistic. (A descent occurs where a card has greater value than the card immediately below it.) 
This means that the probability of a shuffle producing descents in prescribed positions depends 
only on the positions of descents before the shuffle, and not on the exact deck orde r. (Th is fact also 
follows from the d escent set being a “shuffle-compatible statistic”, which (Gessel ES) attributes 
to Stanley.) (Pang [20141 . Sec. 4.7) constructs many Markov statistics for inverse riffle-shuffling out 
of commutation quotients of the free associative algebra. 

The second way in which the Hopf formulation aids in studying these Markov chains is that, in 
many cases, there are algorithms to compute a basis of eigenvectors for the Hopf-power maps and 
hence the transition matrices. T his gi ves interesting information about the long term behaviour of 
the chain. As an example, (Pang l2014l . Prop. 6.1.3 and Prop. 6.1.5) state that, if a deck of n distinct 
cards was originally in ascending order, then, after t iterations of the a-handed shuffle, the expected 
number of descents is (1 — a“*)^^, and the expected number of peaks is (1 — (A peak 

is a triple of adjacent cards with the middle one having greatest value.) Although the algorithms 
do not provide all eigenvectors of all Hopf-power Markov chains, their stationary distributions are 
always computable. 


A new extension: Markov chains from convolutions of projections 


As (Pang[201J, Th. 4.4.1) shows, the breaking step of a Hopf-power Markov chain always involves 
a symmetric multinomial distribution. However, it is sometimes more natural to consider other 
distributions. For example, the restriction-then-induction chain mentioned above is difficult to 
express in terms of partitions, because the restriction of a symmetric group representation to a 
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multinomially-chosen Young subgroup involves Littlewood-Richardson coefficients. Simpler is the 
Markov chain which removes a random corner box and re-inserts it in a random position. 

The discovery in this extended abstract is that, by replacing in the definition of a 

Hopf-power Markov chain with a non-negative convolution of projections (Dehnition 12. 3 j) . one can 
change the piece sizes in the breaking step to have any desired distribution. For example, the 
“remove and re-insert a box” chain above comes from the map m(Proj;^ (8)i)A, using exactly the 
same Doob transform, (l is the identity map and Proj^ is projection to the subspace of degree 
1.) On the shuffle algebra, m(Proj]^ (8>t)A defines the much studied top-to-random shuffle', take 
the top card off the deck, then re-insert it at a uniformly chosen position. T he ot her non-negative 
convolutions of projections recover the shuffles of (Diaconis, Fill, and Pitman [l992l ). where the deck 
is cut in some specified distribution, and then the cards dropped one by one from the bottom of 
piles chosen with probability proportional to pile size. Their Corollaries 5.1 and 5.2 are a formula 
for the composition of such shuffles, and an upper bound for the mixing t ime. The inverses of such 
shuffles are examples of pop shuffles of (Bidigare, Hanlon, and Rockmore 19991 ) . so the eigenvalues 
and multiplicities can be calculated with their hyperplane walk theory. 

This new class of Markov chains admit analysis by the same techniques as for Hopf-power 
Markov chains. Maps between Hopf algebras “respecting the bases” again induce projections of 
their associated chains fTheorem 14.11 below). Consequently, the descent set is a Markov statistic 
under all these shuffling schemes. Existing literature on convolutions of projections provides the 
eigenvalues and multiplicities of these transition matrices. In some cases, there are eigenbasis 
algorithms resembling those for the Hopf-power chains. Once again, the stationary distributions of 
all these chains are accessible - they are precisely the same as those for the Hopf-power chains. 

One notable shuffle outside this framework is random-to-random: uniformly choose a card 
to remove from the deck, and re-insert it in a uniform position. Its defining linear map is an 
interesting operator on other combinatorial Hopf algebras, and it would be great to find a probability 
interpretation. 

This extended abstract is organised as follows: Section [2] gives the conditions on the two main 
characters in this story, the state space basis of a combinatorial Hopf algebra and the non-negative 
convolution of projections map. Section [3] explains how to construct the Markov chain via the 
Doob transform. Section 0] states the three main theorems: how morphisms of Hopf algebras lead 
to Markov statistics; the eigenvalues of the transition matrices and their multiplicities; and the 
common stationary distributions. Section [5] shows one scenario where explicit eigenbasis formulae 
are available, and gives probability applications both for shuffling and for a chain on trees. 

Acknowledgements: I would like to thank Thomas Lam and Nathan Williams for inspiring this 
project, and Nantel Bergeron, Persi Diaconis, Ian Grojnowski, Philippe Nadeau and Franco Saliola 
for many helpful discussions. The comments from the referees, especially the detailed pointers to 
the literature, are much appreciated. 


2 Combinatorial Hopf Algebras and the Convolution Product 

The starting point of our Markov chain construction is a combinatorial Hopf algebra, which encodes 
the breaking and combining rules for our family of combinatorial objects. An instructive example 
is the shuffle algebra, whose associated Markov chains describe various models of shuffling. 

Example 2.1. The shuffle algebra has a basis B of words, which we will think of as decks of cards. 
For example, the word accb will denote the deck with card a on top, followed by two copies of card 
c, and card b on the bottom. This algebra is graded by the lengths of the words, or the number of 
cards in the deck. The product of two words is the sum of all their interleavings (with multiplicity), 
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and the coproduct of a word is the sum of all its deconcatenations. For example, 

m{ac ( 8 ) cb) = 2accb + acbc + cacb + cabc + cbac; 

A{accb) = 1 8 > accb + a 8 ccb + ac 8 c 6 + acc 8 6 + accb 8 1. 


The exposition of (Grinb erg an d Reiner 12014^ gives background on combinatorial Hopf algebras, 
and the opening of (Foissy 2012l l contains an extensive list of examples with references. Like 
many recent treatments, these focus on generalisations of the symmetric functions, which, though 
extremely impo rtant, are not so integral to the present Markov chain application. The thesis of 
(Klausner 120111 . Sec. 4) is closer to the viewpoint herein. 

There is no rigorous definition of a combinatorial Hopf algebra. The intuition is that such 
an algebra Ti should have a basis B indexed by a family of combinatorial objects, graded by their 
sizes. (Assume throughout that the ground field is Q or M, to facilitate the probability applications.) 
Write T-Ln for the subspace of of degree n, so = 0 Tin- Since the empty object is the only object 
of size 0, the vector space is connected, i.e. dimT^o = 1- Now equip with a multiplication map 
m : "Rj 8 —>■ T-Li+j- for w,z G B, set m{w 8 z) to be the sum (possibly weighted) of all possible 

results from “combining” w and Similarly, the coproduct map A : Tin 0^=0 T^n-i takes 
X G B to the sum (possibly weighted) of tc 8 ^ over all pairs {w, z) obtainable by “breaking” x. 
There are various axioms that these operations must satisfy. 

The above combinatorial interpretation of the product and coproduct motivates that the struc¬ 
ture constants be non-negative (conditions i and ii below). This is crucial for the transition prob¬ 
abilities in Definition 13.11 to be non-negative. As we will discover in Section [3l another important 
hypothesis for the construction of the associated Markov chains is that A(x) 7 ^ I 8 x + x 8 l for 
any x G B oi degree greater than 1. Intuitively, it mandates that “every combinatorial object of 
size greater than 1 can be broken into strictly smaller pieces”. Combining these criteria leads to 
the following dehnition: 


Definition 2.2. (Pang l2014l . Def. 4.3.3) A basis B = of a graded connected Hopf algebra 

% = 0 ^ T-iri is a state space basis if: 

i) for all w,z G B, the expansion of m{w 8 z) in the B basis has all coefficients non-negative; 

ii) for all x G B, the expansion of A(x) in the B ®B basis has all coefficients non-negative; 

iii) for n > 1, the basis Bn contains no primitive elements. That is, A(x) 7 ^ I 8 x + x 8 l for all 
x G Bn with n > 1. 

Note that % may contain primitive elements of high degree, so long as they are not in the basis 

B. 

Having hxed a combinatorial Hopf algebra and a degree n, the next step is to choose the distri¬ 
bution of sizes of pieces in the breaking step of the Markov chain. As Step 1 of the interpretation 
in Section [3] will show, these distributions are in bijection, up to scaling, with the class of non¬ 
negative convolutions of projections, as defined below. For example, the operator Proj^^ * Pi'ojd 2 
will constrain the first piece to be of size exactly di, and the second piece to be of size exactly d 2 - 
Given maps f, g : % ^ their convolution product f * g : ^ Ti is the composition 

f*g-= m{f^g)A. 

Since the product m on 'H is associative and the coproduct is coassociative, the convolution product 
is associative, and it is useful to view 

fi*---* fa as (/i 8 • • • 8 /a) . 
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Here H and A[“] : H are the a-fold product and a-fold coproduct respectively, 

describing the combining of a objects and breaking one object into a pieces (some of which may 
be trivial). So = m and A^l = A. (A precise definition, by induction, is as follows: := l, 

mH := ig) i); A^^l := l, At“l := (i ® • (8) t ® A)A1““^].) 

The present Markov chain application uses only the case fi = Proj^., the projection to the 
subspace of degree d,. In other words, Proj^ : T-L —>■ T-Ld is the linear map satisfying Proj^(x) = x 
for X G and Proi^(?^,) = 0 if i 7 ^ d. It will be convenient to allow the case d = 0. 

(Patras 19941 ) studied linear combinations of these convolutions of projections. He called them 
descent operators, since his Theorem H.7 shows that, on a commutative or cocommutative Hopf 
algebra, these operators form an algebra under composition isomorphic to the descent algebra of the 
symmetric group. This connection is central to the eigenbasis algorithms for the present Markov 
chains, which come from the following subset of these operators: 


Definition 2.3. Let 'H be a graded connected Hopf algebra, and fix an integer a > 2. A map 
T : Ti ^ 7i of the form 

T := ^ Projrf^ * • • • * Proj^^ 

di ,...,da 

is a non-negative convolution of projections on 'Hn if 

i) for all weak-compositions D := {di,... , da) of n (that is, dj > 0 and Y^di = n), the coefficients 
ud are non-negative; 


ii) for some weak-composition D of n where each dj 7 ^ n, the coefficient is positive. 


The second axiom ensures that T : T-Ln Hn is not merely multiplication by a constant. Note 
that the map T does not uniquely determine the coefficients an, because of the possibility of 
parts of size 0. Different choices oi an lead to different interpretations of the same Markov chain. 
One final remark on this definition: it is fine for infinitely many a/) to be non-zero, as the image 
T(x) of any particular x € 'H is always a finite sum. This is because there are only finitely many 
weak-compositions of a given integer into exactly a parts. 

One key example of a non-negative convolution of projections is the ath Hopf-power map 
m[“lA[“]. This is the ath convolution power of the identity map, so it corresponds to setting 
«£) = I for all weak-compositions D. As the three-step interpretation of Section [3] will explain, the 
associated Markov chains have a sym metric multinomial breaki ng step. This is the case previously 
studied in (Diaconis, Pang, and Ram lioi^ ) and in (Pang 2014 ). Another important specialisation 
comes from otii n-i) = 1, oid = 0 if D 7 ^ (1, n — 1) for any n, so T = Projj^ *i. This map produces 
Markov chains which break off a singleton and reattach it, analogous to the top-to-random shuffle 
of the introduction. 

More examples are at the end of the next section. 


3 Building The Markov Chain 

The following fact is the main motivation for the dehnition of a Markov chain for each non-negative 
convolution of projections: the probability that a riffle-shuffle takes a deck x of n cards to a deck 
y is the coefficient of y in ^mA(x). In other words, the transition matrix of the riffle-shuffling of 

n cards is [^mA]^ , the transpose of the matrix of the linear operator ^mA with respect to the 
basis Bn of words. A similar direct calculation shows that the top-to-random shuffle of n cards has 
transition matrix [iProj]^*i]g . 
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These observations suggest defining the transition matrix to be 4T on other combinatorial 

-I Bn 

Hopf algebras, for other non-negative convolution of projections T and some appropriate number 
/3. However, such a matrix represents transition probabilities only when each of its rows sums 
to 1. In other words, this naive generalisation fails if the rows of [T ]^^ do not sum to the same 
number. One of the major findings of (Diaconis, Pang, and Ram l2014l . Th. 3.4) is that, when T is a 
Hopf-power map T = and H is a state spa ce bas is, then it is possible to define a rescaling 

H of H so the row sums of [T]g are equal. (Pang 120141 . Sec. 4.3) gives a much slicker and more 
general description of this rescaling, in terms of the Doob trans form. This allows a generalisation 
to linear maps on % that are not the Hopf-power. Indeed, fPang [2014l . Th. 3.1) describes the choice 
of rescalings H of H that are available for arbitrary linear maps. 

It happens that the standard rescaling for the Hopf-power maps also applies to non-negative 
convolutions of projections; indeed, the rescaling necessary to construct a T-Markov chain depends 
only on the underlying Hopf algebra, and not on the map T. This resulting Markov chain is: 


Definition 3.1. Let % = ©n>o^n be a graded connected Hopf algebra with state space basis B. 
For X € Bm let rj^x) denote the sum of the coefficients (in the basis) of Projf” A["'](x), and let 


B„. := 


X 


r]{x) 


X € Br, 


Let T = ^ an Proj^^ * • • • * Proj^^ be a non-negative convolution of projections on T-Ln- Then the 
T-Markov chain on B„ has transition matrix 


—T 

Pn 


where f^n := 


O-D 


dlH- \-da=n 


n 


di...dc 


IS never zero. 


In the shuffle algebra. 


The hypotheses of a state space basis ensure that r]{x) 
r]{x) = 1 always, so no rescaling is necessary. 

Expressing the above transition probabilities in terms of a “natural” process requires careful 
analysis of the underlying Hopf algebra. Fortunately, one only needs t o do t his once for each Hopf 
algebra to interpret all its T-Markov chains, as an analogue of fPang l2014l . Th. 4.4.1) shows that 
these have the form: 


1. Choose a weak-composition (di,... ,da) of n with probability 

2 . Choose a way to break into pieces of sizes di,... ,da- 

3. Choose a way to combine these pieces. 

Here, the probabilities of the choices in steps 2 and 3 depend only on the Hopf algebra, not on 
T. (The exact expression s for t hese probabilities are unsightly and not instructive; the interested 
reader may consult (Pang [2014l . Th. 4.4.1).) For example, a T-shuffte (the T-Markov chain on the 
shuffle algebra) of n cards is the following: 

1 ,2. Cut the deck into piles of sizes (di,... ,da) with probability ■ 

3. Drop one-by-one the bottommost card from a pile chosen with probability proportional 

to the current pile size. 


Aside from the GSR riffle-shuffle, its a-handed generalisation, and the top-to-random shuffle, here 
are some additional notable T-shuffles: 
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Example 3.2. (Assaf, Diaconis, and Soundararaian l2012l l give the mixing time for shuffles with 
biased cuts, when the shuffler prefers to take more cards in one hand than the other. Here, the 
probability of cutting i cards off the top of a deck of n cards is the asymmetric binomial, ( 7*(1 — 
some parameter q € (0,1). The associated non-negative convolution of projections is 


T = 


* ProJi * Proj„_i 


i=0 


This has an obvious a-handed generalisation, with a — 1 parameters. Setting all parameters to ^ 
then recovers the a-handed riffle-shuffle (even though the associated non-negative convolution of 


projections is then a 


*, as these Markov chains depend on T only up to scaling). 


Example 3.3. (Diaconis, Fill, and Pitman Il992l . Sec. 2 and Sec. 6 , Ex. 2 ) discuss two notions of 
top-m-to-random shuffles: T = Proj^ *l corresponds to cutting off m cards and re-inserting them 
randomly, keeping their relative order, whilst T = Projj^™' *l cuts m cards off and inserts them 
randomly in any order. For both schemes, they show that ^ log n shuffles suffice to randomise the 
deck. 

Example 3.4. Taking T = Proj^ *l + l* Proj^ produces a shuffle where the pile sizes are (1, n — 1) 
or (n — 1,1), each with probability In other words, flip a fair coin, and perform a top-to- 
random shuffle if the coin shows heads, and a bottom-to-random shuffle if it sh ows ta ils. This is 
the (symmetric) top-or-bottom-to-random shuffle of (Diaconis, Fill, and Pitman Il992l . Sec. 6, Ex 
4). It is easy to introduce an asymmetry here: for q £ [0,1], take T = gProj^ -|- (1 — q)i * Proj^. 
Setting q = 1 then recovers the top-to-random shuffle. Theorem 15.11 below exhibits an eigenbasis 
for this map on cocommutative Hopf algebras. 

The Markov chains coming from the above choices of T, on other Hopf-algebras, are also 
interesting. 

Example 3.5. The irreducible representations of the symmetric groups form a basis of a Hopf 
algebra, with product being external induction, and c oproduct coming from restriction to Young 
subgroups. It is straightforward to adapt (Pang|201J, Ex. 4.4.3) to give the following description 
for each step of a T-Markov chain, starting from a representation x of &n- 

1. Choose a Young subgroup &di x • • • x of ©„ with probability • 

2. Restrict the starting state x to the chosen subgroup. 

3. Induce it back up to ©„, then pick an irreducible constituent with probability propor¬ 
tional to the dimension of its isotypic component. 

In particular, the Proj^ *i-chain is restricting to ©n-i) inducing back to then choo sing a n 
irreducible constituent as in step 3. This chain previously appeared in the work of (Fuhnan 2009l l. 
where it generates central limit theorems for character ratios. 

For a more involved example, see Example [5]3] regarding the (g Proj^ *i-|-(l —g)t*Proj]^)-Markov 
chain on trees. 
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4 Projection Theorem and Eigenvalue Multiplicities 


As outlined in the introduction, viewing Markov chains in this Hopf-theoretic framework gives two 
useful consequences. The first is the construction of Markov statistics from maps between Hopf 
algebras: 

Theorem 4.1. Let %, % he graded, eonnected Hopf algebras with bases B, B respectively. Suppose 
in addition that B is a state space basis. Let T be a non-negative convolution of projections. If 
6 : H ^ H is a Hopf-morphism such that 0{Bn) = Bn for all n, then the T-chain on Bn is the 
projection under 6 of the T-chain on Bn. 


It follows that 0 is a Markov statistic for the T-chain on Bn - this fact would be interesting by 
itself even if the projected chain were not identified as the T-chain on the target Hopf algebra. 

The second profit of the Hopf formulation is the following expression for all the eigenvalues and 
multiplicites of these Ma rkov chains, which shed some light on their lo ng te rm behaviour. It comes 


from combi ning ( Patras Il994l . Th. H.7), (Krob, Leclerc, and Thibon 119971 . Prop. 3.12 
and Pfeiffer 120081 . Prop. 3 .10) a nd the arguments of (Krob, Leclerc, and Thibon Il99 
and of (Aguiar and Lauve 20131 ). 


I 


(Bonnafe 
Th. 3.21) 


Theorem 4.2. Work in the setup of Definition \3.1{ Given a partition 
weak-composition D = {di,...,da), let {X, D) denote the number of set 
{1,2,...,/} such that, for each i € {!,... a}, we have ~ 

the inner product {px, hjo) of symmetric functions, hence the notation.) 
^T : Hn Hn O'fe 


A := (Ai,..., A/) and a 
partitions Bi \ ... \Ba of 
(So {X,D) is equal to 
Then the eigenvalues of 


Pn 


^J2^d{X,D) 

Dhn 


A h n 


and the multiplicity of the eigenvalue is the coefficient of xx := xxi ■ ■ ■ xxi in the generating 
function a(l — Xi) where bi satisfies 


^dimiHX = n(l 

n i 


Futhermore, T is diagonalisable if H is commutative or cocommutative. 

Note that agrees with the (In of Definition 13.11 so this is not a point of confusion. Here’s 
how this formula specialises to some key examples: 


Example 4.3. Let T = ProJi *l, the top-to-random map. Recall that this corresponds to a(i,n-i) = 
1, and all other an = 0. So, for all partitions A of n, it holds that /3 a = (A, (l,n — 1)), and this 
is the number of parts of size 1 in A, which can be 0,1, ..., n — 2, or n. So the eigenvalues of a 
top-to-random chain on any Hopf algebra are (lx/P(n) ~ n’ n’ • • • ’ 

_ In the case of the shuffle algebra, for a deck of distinct cards, (Diaconis, Fill, and Pitman 

19921 . Th. 4.1) show that the multiplicity of the eigenvalue - is the number of permutations with 


n — j fixed points, and find projection operators for each eigenspace (on the right). (Hivert et al. 


2011 


Sec. 4.6) produce an eigenbasis by associating each permutation with n — j fixed points to 
an eigenvector of eigenvalue I. The present Proj^ *i,-chain framework generalises this eigenbasis 
algorithm for decks with repeated cards; see the remark after Proposition 15.21 It follows from a 
multigraded refinement of Theorem 14.21 above that, for any T-shuffle of a deck of distinct cards, 
the multiplicity of the eigenvalue (Ix/I3(n) is the number of permutations of cycle type A. 
















Example 4.4. Take T = (gProj^ *i +(1 — ( 7 )t*Proj]^), the “asymmetric top-or-bottom-to-random” 
operator. Its eigenvalues are 

^ ^ g(A, (l,n - 1)) + (1 - g)(A, (n - 1,1)) 

/3n n 

Note that the definition of (A, D) depends only on the part sizes of the composition D and not on 
their order, so (A, (l,n — 1)) = (A, (n — 1,1)). Hence the eigenvalues of top-or-bottom-to-random 
are the same as for top-to-random in Example 14.31 above, with the same multiplicities. 

Using the multiplicity in Theorem 14.21 for the eigenvalue 1 = shows that the following 
expressions, which are easily shown to be linearly independent stationary distributions, do span 
the eigenspace of eigenvalue 1. 

Theorem 4.5. For a fixed state space basis B, all T-Markov chain on Bn have the same set of 
stationary distributions. These can be uniquely written as a linear combination of the functions 

.^ E coefficient of x in the product Co-(i) ... Co-(n) 


for each multiset {ci,... ,c„} in Bi. (Here, t]{x) are the rescaling constants of Definition \3.1\ ) 


As noted in (Pang l2014l . Th. 4.5.1), T^cx,...,cn{x) essentially enumerates the ways to build x out 
of Cl,..., Cn using the multiplication of the combinatorial Hopf algebra, and to then break it into 
singletons. In the case of card-shuffling, the unique stationary distribution for all T-shuflles is the 
uniform distribution. 


5 Eigenvectors and Applications 


Since the coefficients «£) of a non-negative convolution of projections can take any non-negative 
value, it’s not surprising that there is no neat universal eigenbasis algorithm for these chains. 
Howe ver, one case which works out nicely, thanks to the theory of dual graded graphs of (Fomin 


19941 . Th. 1.6.6), is the top-or-bottom-to-random chain of Example 13.41 


Theorem 5.1. Let H be a graded connected Hopf algebra, and V be a (graded) basis of its primitive 
subspace. Write V as the disjoint union Vi HP>i, where Vi has degree 1. Set 


E,,= 


EE 

2 = 0 <7G©7 


*c^(i) • • • c^(i) 


Pr(l) • • •Pr(fc) 




... 


(i) 


ranging over all multisets {ci,. .. ,Cj} ofVi, and all multisets {pi,... ,pk} of Vyi where degpi -|- 
• • • degpfc = n — j. Then Ej is a linearly independent set of eigenvectors of eigenvalue ^ for the 
operator A(qrProj]^ -|- (1 — q)i. * ProJi) : Hn —>■ TCn- Furthermore, if H is cocommutative, then 
H”Tg Ej IL En is a basis. 

Here are some simple applications of these eigenvectors to the top-or-bottom-to-random shuffle 
of a deck of distinct cards, analogous to the statements for riffle-shuffling in the fifth paragraph 
of the introduction. (The shuffle algebra is commutative, so its dual is cocommutative, and the 
eigenvectors of T on this dual give right eigenfunctions of the transition matrix, from which one 
deduces these results.) 
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Proposition 5.2. Let {Xt} denote the top-or-bottom-to-random shujfle, with parameter q, of a 
deck of n distinct cards. Suppose the starting deck Xq is in ascending order. Let Des(X) C 
{1,2,... ,n — 1} and Peak(X) C {1, 2,... , n — 2} denote the positions of the descents and peaks of 
X respectively. Then 

E^ctationi ^ 

{iePeak(Xt) ^ ^ J \ ^ ■' / 

Setting q = 1 in Theorem 15.II gives the eigenvectors for the top-to-random operator Proj^ *l as 

Co-(i) . . . Cg-Q) Pt{ 1) ■ ■ ■PT{k)- 


These are also the eigenvectors of Proj{"* */,, the unordered version of top-m-to-random as in Ex¬ 
ample [331 with eigenvalue (m)/(m)- reason is that, on a cocommutative Hopf algebra, the 
map Proj^™' *i is a polynomial in Proj^ *i. Similarly, the Ej in Theorem [5T] are eigenvectors of any 
polynomial in gProj]^ *l + {1 — q)L * Proj^. In particular, they have eigenvalue ^2 f^^e follow¬ 
ing ma p, cor responding to the trinomial-top-and-bottom-to-random shuffle of (Diaconis, Fill, and 
Pitman 1992 . Sec. 6, Ex 6): 


E 


miH-m2+m3=n 


milma! 




*mi 


*6 * Proj^ 


•*m3 


(Here, (?i + ^2 + 93 = Ij and the previous q is 


91 


91+93 


in terms of the new parameters.) 


To finish, here is an example away from the world of card-shuffling, to illustrate the diversity 
of Markov chains that this framework can analyse. 

Example 5.3. We will study the trinomial-top-and-bottom-to-random Markov chain (of two para¬ 
graphs prior) on the Connes-Kreimer Hopf algebra of rooted forests. Take as state space basis the 
set of all rooted forests - that is, a disjoint union of trees, each of which has a distinguished root 
vertex. (The vertices are unlabelled, and the embedding of the tree in the plane is immaterial.) 
The degree of a forest is its number of vertices. The product of two trees is their disjoint union, 
and the coproduct of a tree T is the sum of T\S (8> S over all connected subtrees S' of T which are 
either empty or contain the root of T. Hence l * Projj^ corresponds to removing a root of a forest , 
and Proj^ *i to removing a leaf. For a full definition of this algebra, see (Connes and Kreimer 19981 . 
second half of Sec. 2). 

The Hopf-power Markov chain on this algebra was the subject of fPang [20l3 . Sec. 5.3). Adapting 
Theorem 5.3.8 there gives the following description of the trinomial-top-and-bottom-to-random 
chain: 

Suppose a company has a forest structure, so all employees have at most one direct superior. 
All employees are either regular employees or VPs, and the superior of a VP is necessarily also a 
VP (so the VPs in each connected component of the company form a subtree containing the root). 

Each month, each regular employee independently produces excellent work with probability 93 , 
average work with probability 92 , and subpar work with probability 91 (where 91 -|-92 + 93 = 1-) For 
each employee who produced excellent work, one by one in a random order, the regular employee 
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furthest up the chain of superiority from him becomes a VP. Then the employees who produced 
subpar work are fired, one by one starting from the most superior. Each firing causes a cascade 
of promotions: first, someone further down the chain of superiority from the fired employee is 
uniformly selected to replace him. Then, if the promoted employee was superior to anyone, then 
one of those is uniformly selected and promoted to his position. This process continues until 
someone who is not superior to anyone is promoted. 

The chain keeps track of the structure of the regular employees, but does not know which 
employee is taking which position in the forest structure, nor does it see the structure of the VPs. 

The cases j = 2,3 below are analogues of Proposition 15.21 for this chain. These are inequal¬ 
ities, rather than equalities, because the exact eigenvectors in Theorem 15.11 are very complicated 

(involving a second sum), so to obtain a slicker result, we use instead the estimates fj. Defining 
these requires some more notation: for a regular employee u, let desc(u) (resp. anc(u)) denote the 
number of regular employees who are further down (resp. up) from u in the chain of superiority, 
including u himself in both counts. (In tree language, these are the descendants and the ancestors). 
Also, write n'{u) for the size of the connected component of regular employees containing u. 


Proposition 5.4. Let {Xt} denote the trinomial-top-and-bottom-to-random Markov chain on the 
Cannes-Kreimer Hopf algebra of rooted forests, with interpretation and notation as above. For each 
integer j > 2, define the following funetions on forests: 


uGT 


desc(ji) anc(it) /deSc('U.) 


% 


{ 


(The binomial coefficient is 0 if desc{u) < j.) Then 

{ 


Expectation I ^AXt) > < On^fAVn) max , , 

n ^ 3\ ^2 3\ ^\^Xo:desc(u)>j {y&nciu) -1 


n'(u) 
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